<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: Edgeworth's Data on Death Rates in British Counties</title>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<link rel="stylesheet" type="text/css" href="R.css" />
</head><body>

<table width="100%" summary="page for EdgeworthDeaths"><tr><td>EdgeworthDeaths</td><td style="text-align: right;">R Documentation</td></tr></table>

<h2>
Edgeworth's Data on Death Rates in British Counties
</h2>

<h3>Description</h3>

<p>In 1885, Francis Edgeworth published a paper, <em>On methods of ascertaining variations in the rate of births, deaths and marriages</em>.
It contained among the first examples of two-way tables, analyzed to show variation among row and column factors,
in a way that Fisher would later formulate as the Analysis of Variance. 
</p>
<p>Although the data are rates per 1000, they provide a good example of a two-way ANOVA with n=1 per cell,
where an additive model fits reasonably well. 
</p>
<p>Treated as frequencies, the data is also a good example of a case where the independence model fits
reasonably well.
</p>


<h3>Usage</h3>

<pre>data("EdgeworthDeaths")</pre>


<h3>Format</h3>

<p>A data frame with 42 observations on the following 3 variables.
</p>

<dl>
<dt><code>County</code></dt><dd><p>a factor with levels <code>Berks</code> <code>Herts</code> <code>Bucks</code> <code>Oxford</code> <code>Bedford</code> <code>Cambridge</code></p>
</dd>
<dt><code>year</code></dt><dd><p>an ordered factor with levels <code>1876</code> &lt; <code>1877</code> &lt; <code>1878</code> &lt; <code>1879</code> &lt; <code>1880</code> &lt; <code>1881</code> &lt; <code>1882</code></p>
</dd>
<dt><code>Freq</code></dt><dd><p>a numeric vector, death rate per 1000 population</p>
</dd>
</dl>



<h3>Details</h3>

<p>Edgeworth's data came from the Registrar General's report for the final year, 1883.
The <code>Freq</code> variable represents death rates per 1000 population in the six counties listed.
</p>


<h3>Source</h3>

<p>The data were scanned from Table 5.2 in 
Stigler, S. M. (1999) <em>Statistics on the Table: The History of Statistical Concepts and Methods</em>,
Harvard University Press.
</p>


<h3>References</h3>

<p>Edgeworth, F. Y. (1885). On Methods of Ascertaining Variations in the Rate of Births, Deaths, and Marriages. 
<em>Journal of the Statistical Society of London</em>, 48(4), 628-649. doi:10.2307/2979201
</p>


<h3>Examples</h3>

<pre>
data(EdgeworthDeaths)

# fit the additive ANOVA model
library(car)  # for Anova()
EDmod &lt;- lm(Freq ~ County + year, data=EdgeworthDeaths)
Anova(EDmod)

# now, consider as a two-way table of frequencies

library(vcd)
library(MASS)
structable( ~ County + year, data=EdgeworthDeaths)
loglm( Freq ~ County + year, data=EdgeworthDeaths)

mosaic( ~ County + year, data=EdgeworthDeaths, 
	shade=TRUE, legend=FALSE, labeling=labeling_values, 
	gp=shading_Friendly)


</pre>


</body></html>
